81 research outputs found

    Un modÚle de mélange pour la classification croisée d'un tableau de données continue

    Get PDF
    National audienceContrairement aux mĂ©thodes de classification automatique habituelles, les mĂ©thodes de classification croisĂ©e traitent l'ensemble des lignes et l'ensemble des colonnes d'un tableau de donnĂ©es simultanĂ©ment en cherchant Ă  obtenir des blocs homogĂšnes. Dans cet article, nous abordons la classification croisĂ©e lorsque le tableau de donnĂ©es porte sur un ensemble d'individus dĂ©crits par des variables quantitatives et, pour tenir compte de cet objectif, nous proposons un modĂšle de mĂ©lange adaptĂ© Ă  la classification croisĂ©e conduisant Ă  des critĂšres originaux permettant de prendre en compte des situations plus complexes que les critĂšres habituellement utilisĂ©s dans ce contexte. Les paramĂštres sont alors estimĂ©s par un algorithme EM gĂ©nĂ©ralisĂ© (GEM) maximisant la vraisemblance des donnĂ©es observĂ©es. Nous proposons en outre une nouvelle expression du critĂšre bayĂ©sien de l'information, appelĂ©e BIC_B, adaptĂ©e Ă  notre situation pour Ă©valuer le nombre de blocs. Des expĂ©riences numĂ©riques portant sur des donnĂ©es synthĂ©tiques permettent d'Ă©valuer les performances de GEM et de BIC_B et de montrer l'intĂ©rĂȘt de cette approche

    Exploring Topic Variants Through an Hybrid Biclustering Approach

    Get PDF
    In large text corpora, analytic journalists need to identify facts, verify them by locating corroborating documents and survey all related viewpoints. This requires them to make sense of document relationships at two levels of granularity: high-level topics and low-level topic variants. We propose a visual analytics software allowing analytic journalists to verify and refine hypotheses without having to read all documents. Our system relies on a hybrid biclustering approach. A new Topic Weighted Map visualization conveys all top-level topics reflecting their importance and their relative similarity. Then, coordinated multiple views allow to drill down into topic variants through an interactive term hierarchy visualization. Hence, the analyst can select, compare and filter out the subtle co-occurrences of terms shared by multiple documents to find interesting facts or stories. The usefulness of the tool is shown through a usage scenario and further assessed through a qualitative evaluation by an expert user.Dans des corpus textuels volumineux, les journalistes analytiques cherchent des documents et des rĂ©cits qui corroborent des faits, en les examinant sous tous les angles. Nous prĂ©sentons un outil de visualisation analytique leur permettant de vĂ©rifier, d’affiner et de gĂ©nĂ©rer des hypothĂšses sans avoir Ă  lire la totalitĂ© des contenus. Notre systĂšme repose sur une approche hybride de biclustering. Les sujets de haut niveau sont prĂ©sentĂ©s via une carte pondĂ©rĂ©e de sujets, reflĂ©tant Ă  la fois leur importance et leur similaritĂ© relative. Pour chaque sujet, une vue hiĂ©rarchique et interactive dresse un aperçu de toutes ses variantes, de maniĂšre Ă  identifier les documents traitĂ©s sous un mĂȘme angle ou partageant des faits communs. Des vues multiples et coordonnĂ©es permettent une analyse plus fine, en filtrant, sĂ©lectionnant et comparant les variantes de sujet, au regard des motifs de co-occurrence de termes les plus intĂ©ressants. L’utilitĂ© de l’outil est montrĂ©e par un scĂ©nario d’usage, puis Ă©valuĂ©e qualitativement par un journaliste analytique

    Generalized topographic block model

    No full text
    Co-clustering leads to parsimony in data visualisation with a number of parameters dramatically reduced in comparison to the dimensions of the data sample. Herein, we propose a new generalized approach for nonlinear mapping by a re-parameterization of the latent block mixture model. The densities modeling the blocks are in an exponential family such that the Gaussian, Bernoulli and Poisson laws are particular cases. The inference of the parameters is derived from the block expectation–maximization algorithm with a Newton–Raphson procedure at the maximization step. Empirical experiments with textual data validate the interest of our generalized model

    Block Mixture Model for the Biclustering of Microarray Data

    Get PDF
    This publication is a representation of what appears in the IEEE Digital Libraries.International audienceAn attractive way to make biclustering of genes and conditions is to adopt a Block Mixture Model (BMM). Approaches based on a BMM operate thanks to a Block Expectation Maximization (BEM) algorithm and/or a Block Classification Expectation Maximization (BCEM) one. The drawback of these approaches is their difficulty to choose a good strategy of initialization of the BEM and BCEM algorithms. This paper introduces existing biclustering approaches adopting a BMM and suggests a new fuzzy biclustering one. Our approach enables to choose a good strategy of initialization of the BEM and BCEM algorithms

    La politique de confidentialitĂ© d’un site marchand en tant que moyen pour renforcer la confiance des consommateurs en ligne

    Get PDF
    Qu’il soit considĂ©rĂ© comme mĂ©dia ou comme lieu d’achat, internet n’en finit pas de poser des problĂšmes de confiance aux consommateurs en ligne vis-Ă -vis des transactions commerciale. C’est pour cela, gagner la confiance des cyberconsommateurs devient essentiel pour les entreprises spĂ©cialisĂ©es dans le secteur du commerce Ă©lectronique. Afin d’essayer d’instaurer la confiance auprĂšs des consommateurs vis-Ă -vis d’internet et des sites marchands en particulier, de nombreux outils on Ă©tĂ© mis en place tels que le recours Ă  des labels de confiance, mais aussi Ă  des politiques de confidentialitĂ©s pour la protection des donnĂ©es personnelles et le respect de la vie privĂ©e. L’objectif de cet article est d’identifier, Ă  partir d’une Ă©tude qualitative, le moyen par lequel la politique de confidentialitĂ© d’un site marchand peut avoir un impact sur la confiance du consommateur


    Get PDF
    Qu’il soit considĂ©rĂ© comme mĂ©dia ou comme lieu d’achat, internet n’en finit pas de poser des problĂšmes de confiance aux consommateurs en ligne vis-Ă -vis des transactions commerciales. Dans un tel contexte, gagner la confiance des internautes apparait comme une prioritĂ© pour les entreprises exerçant des activitĂ©s commerciales en ligne. Pour tenter d’accroitre la confiance des consommateurs Ă  l’égard d’internet et des sites marchands en particulier, de nombreux moyens ont Ă©tĂ© dĂ©veloppĂ©s et mis en oeuvre tels que recours Ă  des labels de confiance, mais aussi Ă  des politiques de protection des donnĂ©es personnelles et de respect de la vie privĂ©e. La portĂ©e thĂ©orique de cette recherche se situe Ă  un triple niveau. D’abord, malgrĂ© un potentiel assez puissant de travaux sur la confiance en marketing, peu de recherches se sont intĂ©ressĂ©es Ă  l’étude de la confiance en comportement du consommateur. En effet, au regard de la littĂ©rature, force est de constater que ce sont les relations inter-entreprises qui ont constituĂ© le domaine d’application privilĂ©giĂ© pour les recherches. L’objectif de cet article est d’identifier, Ă  partir d’une Ă©tude quantitative exploratoire, les dĂ©terminants de la confiance du consommateur lors d’un achat sur Internet, ainsi que les mesures prises pour son instauration, Ă  travers une enquĂȘte menĂ©e auprĂšs d’un Ă©chantillon de 200 personnes

    Model-based Co-clustering for High Dimensional Sparse Data

    Get PDF
    Abstract We propose a novel model based on the von Mises-Fisher (vMF) distribution for coclustering high dimensional sparse matrices. While existing vMF-based models are only suitable for clustering along one dimension, our model acts simultaneously on both dimensions of a data matrix. Thereby it has the advantage of exploiting the inherent duality between rows and columns. Setting our model under the maximum likelihood (ML) approach and the classification ML (CML) approach, we derive two novel, hard and soft, co-clustering algorithms. Empirical results on numerous synthetic and real-world text datasets, demonstrate the effectiveness of our approach, for modelling high dimensional sparse data and co-clustering. Furthermore, thanks to our formulation, that performs an implicitly adaptive dimensionality reduction at each stage, our model alleviates the problem of high concentration parameters kappa's, a well known difficulty in the classical vMF-based models

    The ARIA-MASK-airÂź approach

    Get PDF
    Funding Information: The authors thank Ms VĂ©ronique Pretschner for submitting the paper. MASK‐air has been supported by CharitĂ© UniversitĂ€tsmedizin Berlin, EU grants (EU Structural and Development Funds Languedoc Roussillon and Region PACA; POLLAR: EIT Health; Twinning: EIP on AHA; Twinning DHE: H2020; Catalyse: Horizon Europe) and educational grants from Mylan‐Viatris, ALK, GSK, Novartis, StallergĂšnes‐Greer and Uriach. None for the study. Âź Publisher Copyright: © 2023 The Authors. Clinical and Translational Allergy published by John Wiley & Sons Ltd on behalf of European Academy of Allergy and Clinical Immunology.MASK-airÂź, a validated mHealth app (Medical Device regulation Class IIa) has enabled large observational implementation studies in over 58,000 people with allergic rhinitis and/or asthma. It can help to address unmet patient needs in rhinitis and asthma care. MASK-airÂź is a Good Practice of DG SantĂ© on digitally-enabled, patient-centred care. It is also a candidate Good Practice of OECD (Organisation for Economic Co-operation and Development). MASK-airÂź data has enabled novel phenotype discovery and characterisation, as well as novel insights into the management of allergic rhinitis. MASK-airÂź data show that most rhinitis patients (i) are not adherent and do not follow guidelines, (ii) use as-needed treatment, (iii) do not take medication when they are well, (iv) increase their treatment based on symptoms and (v) do not use the recommended treatment. The data also show that control (symptoms, work productivity, educational performance) is not always improved by medications. A combined symptom-medication score (ARIA-EAACI-CSMS) has been validated for clinical practice and trials. The implications of the novel MASK-airÂź results should lead to change management in rhinitis and asthma.publishersversionpublishe

    Rhinitis associated with asthma is distinct from rhinitis alone: TARIA‐MeDALL hypothesis

    Get PDF
    Asthma, rhinitis, and atopic dermatitis (AD) are interrelated clinical phenotypes that partly overlap in the human interactome. The concept of “one-airway-one-disease,” coined over 20 years ago, is a simplistic approach of the links between upper- and lower-airway allergic diseases. With new data, it is time to reassess the concept. This article reviews (i) the clinical observations that led to Allergic Rhinitis and its Impact on Asthma (ARIA), (ii) new insights into polysensitization and multimorbidity, (iii) advances in mHealth for novel phenotype definitions, (iv) confirmation in canonical epidemiologic studies, (v) genomic findings, (vi) treatment approaches, and (vii) novel concepts on the onset of rhinitis and multimorbidity. One recent concept, bringing together upper- and lower-airway allergic diseases with skin, gut, and neuropsychiatric multimorbidities, is the “Epithelial Barrier Hypothesis.” This review determined that the “one-airway-one-disease” concept does not always hold true and that several phenotypes of disease can be defined. These phenotypes include an extreme “allergic” (asthma) phenotype combining asthma, rhinitis, and conjunctivitis.info:eu-repo/semantics/publishedVersio
